Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Labels
نویسندگان
چکیده
= {Probabilistic topic models are important tools for indexing, summarizing, and analyzing large document collections by their themes. However, promoting end-user understanding of topics remains an open research problem. We compare labels generated by users given four topic visualization techniquesword lists, word lists with bars, word clouds, and network graphsagainst each other and against automatically generated labels. Our basis of comparison is participant ratings of how well labels describe documents from the topic. Our study has two phases: a labeling phase where participants label visualized topics and a validation phase where different participants select which labels best describe the topics’ documents. Although all visualizations produce similar quality labels, simple visualizations such as word lists allow participants to quickly understand topics, while complex visualizations take longer but expose multi-word expressions that simpler visualizations obscure. Automatic labels lag behind user-created labels, but our dataset of manually labeled topics highlights linguistic patterns (e.g., hypernyms, phrases) that can be used to improve automatic topic labeling algorithms.},
منابع مشابه
Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Topic Labels
= {Probabilistic topic models are important tools for indexing, summarizing, and analyzing large document collections by their themes. However, promoting end-user understanding of topics remains an open research problem. We compare labels generated by users given four topic visualization techniquesword lists, word lists with bars, word clouds, and network graphsagainst each other and against au...
متن کاملUnderstanding and Using Patterns of Food Labeling Systems and their Determinants by Medical Students of Tabriz University of Medical Sciences, Iran
Background and Objectives: Increased public knowledge concerning roles of nutrition in prevention of non-communicable diseases have urged people to select healthy foods. The aim of this study was to investigate levels of understanding and use of food labeling systems and their determinants by medical students of Tabriz University of Medical Sciences, Tabriz, Iran. Materials and Methods: In a c...
متن کاملEvaluating Visual Preferences of Architects and People Toward Housing Facades, Using Multidimensional Scaling Analysis (MDS)
One of the most important issues that have absorbed the public opinion and expert community during the recent years, is the qualitative and quantitative aspects of the housing. There are several challenges related to this topic that includes the contexts of the construction, manufacturing, planning to social aspects, cultural, physical and architectural design. The thing that has a significant ...
متن کاملMining Adverse Events of Dietary Supplements from Product Labels by Topic Modeling
The adverse events of the dietary supplements should be subject to scrutiny due to their growing clinical application and consumption among U.S. adults. An effective method for mining and grouping the adverse events of the dietary supplements is to evaluate product labeling for the rapidly increasing number of new products available in the market. In this study, the adverse events information w...
متن کاملEvaluating Multimodal Representations on Sentence Similarity: vSTS, Visual Semantic Textual Similarity Dataset
The success of word representations (embeddings) learned from text has motivated analogous methods to learn representations of longer sequences of text such as sentences, a fundamental step on any task requiring some level of text understanding [13]. Sentence representation is a challenging task that has to consider aspects such as compositionality, phrase similarity, negation, etc. In order to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- TACL
دوره 5 شماره
صفحات -
تاریخ انتشار 2017